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The  problem  of  selecting  the  normal  population  with  the  largest 
population  mean  when  the  populations  have  a common  known  variance  is 
considered.  A two-stage  procedure  is  proposed  which  guarantees  the  same 
probability  requirement  using  the  indifference-zone  approach  as  does  the 
single-stage  procedure  of  Bechhofer  [1954].  The  two-stage  procedure  has 
the  highly  desirable  property  that  the  expected  total  number  of  observa- 
tions required  by  the  procedure  is  always  less  than  the  total  number  of 
observations  required  by  the  corresponding  single-stage  procedure, 
regardless  of  the  configuration  of  the  population  means.  The  saving  in 
expected  total  number  of  observations  can  be  substantial,  particularly 
when  the  configuration  of  the  population  means  is  favorable  to  the 
experimenter1.  The  saving  is  accomplished  by  screening  out  "non-contending" 
populations  in  the  first  stage,  and  concentrating  sampling  only  on 
"contending"  populations  in  the  second  stage.  The  two-stage  proced  re 
can  be  regarded  as  a composite  one  which  uses  a screening  subset -type 
approach  (Gupta  [1955],  [1965])  in  the  first  stage,  and  an  indifference- 
zone  approach  (Bechhofer  [1954])  applied  to  all  populations  retained  in 
the  selected  subset  in  the  second  stage.  Constants  to  implement  the 
procedure  for  various  k and  Fft  are  provided,  as  are  calculations 
giving  the  saving  in  expected  total  sample  size  if  the  two-stage  procedure 
is  used  in  place  of  the  corresponding  single-stage  procedure. 


1.  Introduction  and  Summary 


In  many  practical  situations  a statistician  is  faced  with  the  problem 
of  designing  an  experiment  to  select  one  (or  more)  out  of  k 2 possible 
competing  categories.  Typically  the  categories  (populations)  are  char- 
acterized by  a real-valued  parameter,  and  the  experimenter  is  interested 
in  selecting  the  population  having  the  largest  (or  smallest)  parameter 
value.  This  population  is  referred  to  as  the  "best"  population.  Thus, 
for  example,  the  medical  research  worker  might  be  studying  the  response  of 
patients  to  different  types  of  analgesic  drugs  in  which  case  his  interest 
might  lie  in  selecting  that  drug  which  produces,  on  the  average,  the  largest 
period  of  time  without  pain,  or  the  agronomist  might  be  conducting  field 
trials  with  different  varieties  of  grain  in  which  case  his  purpose  might 
be  to  select  that  variety  which  produces,  on  the  average,  the  largest  yield 
per  acre. 

Procedures  for  achieving  such  objectives  have  received  considerable 
attention  in  recent  years.  Various  probability  distributions  have  been 
postulated  as  being  appropriate  to  model  these  and  other  real-life  problems, 
and  several  statistical  formulations  of  these  problems  have  beon  proposed, 
and  associated  statistical  selection  procedures  devised.  Recent  reviews 
of  the  literature  with  particular  reference  to  the  normal  means  problem 
appear  in  Wetherill  and  Ofosu  [1974]  and  Bechhofer  [1975].  The  present 
paper  continues  the  study  of  the  normal  means  problem,  and  explores  in 
depth  a new  approach  which  has  highly  desirable  properties.  This  same 
approach  is  also  applicable  to  the  normal  variances  problem. 

The  statistical  formulation  of  the  problem  is  given  in  Section  2.  In 
Section  3 we  sketch  the  relevant  history  of  the  normal  means  problem,  and 
indicate  the  virtues  and  drawbacks  of  the  various  procedures  which  have 


been  proposed  to  deal  with  the  problem;  the  reader  is  thus  enabled  to  under- 
stand the  role  that  our  proposed  procedure  plays.  The  procedure  itself 
as  well  as  the  design  criterion  that  we  adopt  are  described  in  Section  4. 

The  main  analytical  results  are  contained  in  Sections  5 and  6 which  deal 
with  the  probability  of  a correct  selection  and  the  expected  total  sample 
size,  respectively;  Section  5.2  discusses  a key  unsolved  problem  (that  of 
determining  the  so-called  least  favorable  configuration  of  the  population 
means)  associated  with  the  procedure,  while  Section  5.3  contains  a strategem 
which  permits  us  to  bypass  this  difficulty  (at  the  expense  of  some  loss  of 
efficiency  of  our  procedure).  In  Section  7 we  formulate  the  problem  that 
we  must  solve  to  obtain  design  constants  to  implement  our  procedure;  tables 
of  these  constants  are  provided  in  Section  8.  The  performance  of  our  two- 
stage  procedure  relative  to  that  of  the  best  competing  single  stage  procedure 
is  studied  in  Section  9,  and  is  shown  to  be  highly  satisfactory.  We  con- 
clude in  Section  10  with  suggestions  for  future  research  in  this  area. 


2.  Preliminaries 


2.1  Assumptions 

Let  r.£  (1  < i < k)  denote  a normal  population  with  unknown  mean 

ik  and  known  variance  a2,  and  let  ft  = {p^  = (p^, . . . ,p^)  | -®  < p.  < ® (l^ifLk) 
be  the  parameter  space  of  the  p^.  Denote  the  ranked  values  of  the  p..  by 


y[l]  - •**  « w[k3*  and  let  5i,j  = W[i] 


Cj] 


. We  assume  that  the  experi- 


menter has  no  prior  knowledge  concerning  the  pairing  of  the  n with  the 

(1  4 i,j  ^k).  Let  n^j  denote  the  population  associated  with  p^-j. 
Suppose  that  4[k_r]  K Mrk-r+l]  = y[k]  for  some  r ^ = r =k*  where  we 


define  Pj-^  = then  any  one  of  the  r populations  n(k_r+j) 


U < j <r) 


is  regarded  as  "best." 


.yJMu: 


1 

3 

2.2  Coal  and  probability  requirement 

The  goal  of  the  experimenter  is  to  select  a best  population.  This 
event  is  referred  to  as  a correct  selection  (CS).  The  experimenter  restricts 
consideration  to  procedures  (P)  which  guarantee  the  probability  requirement 


!' 


i 


ycslP}  > F*  V U C n(6*) 


(2.1) 


where  {6*,P*}  0 < 6*  < ®,  i/k  < p*  < i are  specified  prior  to  the  start 

of  experimentation,  and 


«(«*)  = {u  r fi|<5k  k_1  > 5*}.  (2.2) 

We  refer  to  fi(6*)  as  the  preference  zone  for  a CS  and  to  fiQ(6*)  = Q-f!(6*) 
as  the  associated  indifference  zone.  The  formulation  (2.1)  is  called  the 
indifference-zone  approach. 


3.  Background:  Single-stage  and  sequential  procedures 

The  indifference- zone  approach  as  applied  to  the  normal  means  ( common 
known  variance)  problem  has  received  considerable  study.  Bechhofer  [1954] 
proposed  a single-stage  procedure  which  guarantees  (2.1)  •,  Hall  [1959] 
showed  that  among  single-stage  procedures  this  procedure  is  "most  economical" 
and  Eaton  [1967]  proved  that  it  has  additional  desirable  decision  theoretic 
properties.  Bechhofer,  Kiefer,  and  Sobel  [1968]  (see  also,  Bechhofer  and 
Sobel  [1954])  proposed  an  open  sequential  procedure  without  elimination 
which  guarantees  (2.1).  Paulson  [1964]  proposed  a closed  sequential 
procedure  with  permanent  elimination  which  also  guarantees  (2.1);  Fabian 
[1974a]  (see  also,  Fabian  [1974b]  and  Lawing  and  David  [1966])  showed  how 
Paulson's  procedure  could  be  modified  to  improve  its  performance  characteristics. 


■ 


¥ V 


The  single-stage  procedure  (P^)  °*  Bechhofer  requires  a common 

sample  size  n per  population  which  is  chosen  in  such  a way  that  (2.1) 
is  guaranteed  even  when  y^-,  - y^  = 6*  (1  4 i < k-1),  this  being  the 

so-called  least  favorable  (EF)  configuration  of  the  population  means. 

However , the  procedure  is  conservative  in  that  if,  unknown  to  the  experi- 
menter, y^^  “ v‘[i]  i 6*  (1  i i 4 k-1)  with  strict  inequality  for  at 

least  one  i- value — in  particular  if  - pi -j  >:>  then 

P^[CS|P^}  > P*  for  the  actual  y_  C fi(5*)  which  the  experimenter  has 
encountered.  If  this  is  the  situation  he  may  have  greatly  overprotected 
himself,  the  overprotection  having  been  purchased  by  the  use  of  a much 
larger  r.- value  than  would  have  been  necessary  had  the  true  y -values  been 
known. 

Unlike  single-stage  procedures,  multistage  or  sequential  procedures 
provide  information  concerning  the  true  but  unknown  y-values  as  sampling 
proceeds . 

The  sequential  procedure  (P  ) of  Bechhofer,  Kiefer,  and  Sofcel  takes 

bl 

a single  vector  of  observations  at  each  stage  of  experimentation.  Here  the 

number  of  stages  (Nr  ) to  terminate  experimentation  is  an  unbounded  r.v. 

(For  P_  a vector  consists  of  one  observation  from  each  of  the  k popu- 
S1 

lations.)  In  addition  to  guaranteeing  (2.1)  when  the  population  means  are 

in  the  LF-configuration,  it  also  reacts  to  favorable  configurations  of  the 

population  means  and  thereby  tends  to  terminate  experimentation  early 

resulting  in  E {HQ  |P  }-values  which  are  smaller  than  n (c.f.,  B-K-S 
- bl  S1 

[1968],  Section  12.8.1).  (Throughout  this  paper  n will  denote  the  single- 
stage  sample  size  for  P^.)  However,  if  P*  is  sufficiently  close  to 


unity  and  if  y^^  - y^  ^ j < 6* — in  particular,  if  = then 

E {H„  |P„  } > n for  the  actual  y P Q (6*)  which  the  experimenter  has 

H.  bl  bi  0 


encountered.  However,  P0  does  have  a practical  disadvantage:  It  is 

bl 

open-ended,  i.e.,  although  N,,  is  finite  w.p.  1,  it  is  unbounded;  this 

bl 

latter  fact  may  inhibit  or  prevent  the  use  of  the  procedure  in  certain 
situations . 

The  sequential  procedure  (F  ) of  Paulson  which  takes  a single  vector 

^ r\ 

of  observations  at  each  stage  of  experimentation,  and  for  wh.lch  the  number 

of  stages  (N0  ) to  terminate  experimentation  is  a bounded  r.v.  (<_  M), 
b2 

is  an  adaptive  procedure.  (For  Pc  a vector  consists  of  one  observation 

"2 

from  each  of  the  populations  still  retained  "in  contention"  by  the 

procedure  at  stage  j (1  < j < M,  k = k > k0  >...>_  k >.  2) , those 

not  retained  at  stage  j being  permanently  eliminated,  here  the  k. 

j 

(2434  M)  are  r.v.'s.)  In  addition  to  guaranteeing  (2.1)  when  the 

population  means  are  in  the  LF-configuration,  it  also  reacts  to  favorable 

configurations  of  the  population  means,  eliminating  from  further  sampling 

populations  which  are  indicated  as  not  being  in  contention,  and  in  general 

terminating  experimentation  early  resulting  in  E ,'P  } -values  which 

- °2  S2 

are  less  than  n.  (See  Ramberg  [1966D.)  Of  considerable  interest  is  the 
fact  that  if  j = y['K]  an^  P*  is  close  to  unity,  then 

E {N  |P  } < E {!'  |P  } for  the  actual  p C which  the  experimenter 

E b2  b2  - bl  bl  0 

has  encountered.  The  quantity  E {T„  !P0  },  where  T = total  number  of 



observations  to  terminate  experimentation,  behaves  similarly  w.r.t. 

E (T  IP,,  }.  Since  N < M we  have  E {N_  |P„  } < M and  E (T_  jP„  } < kM 
V_  S'  Sx  S2  “ U S2'  S2  E S2  S2 

for  all  g S2;  the  bound  M is  a function  of  k,  {6*,?*}  and  also  of 

a design  parameter  A (0  < A <.  6*/2)  which  is  fixed  by  the  experimenter 

before  the  start  of  experimentation. 

Even  though  P_  and  P have  certain  highly  desirable  properties 
1 b2 

relative  to  P^,  both  being  adaptive  and  therefore  being  able  to  capitalize 


oil  favorable  configurations  of  the  population  means,  both  have  the  drawback 
that  they  nay  require  many  stages  to  terminate  experimentation.  Such 
procedures  are  often  very  costly  to  implement , and  in  some  experimental 
situations  may  be  completely  impractical,  e.g,  , in  agriculture  where  only 
one  stage,  i.e.,  vector  of  observations,  can  be  obtained  each  growing 
season,  the  number  of  stages  (years)  to  terminate  experimentation  would  be 
prohibitively  large. 

Thus , in  this  present  paper  we  study  a two-stage  procedure  which  takes 
a fixed  number  of  vectors  of  observations  at  each  stage  of  experimentation. 
The  procedure  guarantees  (2.1)  when  the  population  means  are  in  the  LF- 
configuration.  It  is  adaptive,  eliminating  from  further  sampling  in  the 
second  stage  populations  which  are  indicated  as  not  being  in  contention 
after  the  first  stage,  and  in  general  terminating  experimentation  after 
the  first  stage  if  the  configuration  of  the  population  means  is  very  favor- 
able, e.g.,  ~ lir^  ^-|  >>  6*.  In  addition  this  procedure  is  designed 

to  be  minimax  within  a certain  class  of  two-stage  procedures. 

4.  A twe-stage  procedure  (?2) 

U-3  propose  a two-stage  procedure  = P^n^n^h)  which  depends  on 
non-negative  integers  n^,n2  “nc^  a real  constant  h 0 which  are  deter- 
mined prior  to  the  start  of  experimentation.  The  constants  (n^,n2>h) 
depend  on  k and  and  are  chosen  so  that  P 2 guarantees  (2.1) 

and  possesses  a certain  minimax  property. 

Procedure  P 2 ; 

1.  In  the  first  stage  take  n^  independent  obsei'vations 
(1  < ] < n^)  from  IK  (1  4 i < k),  and  compute  the 


ij 


k 


7 


ssrnpxG  mfcciiis 

-(  j 

x: 

i 

xl^  = max 
Ck3  l<i<k 

x<1}. 

1 

{1,2,. ..,k) 

where 

! ’ ' 

I'J 


Let 


j=l 

Determine  the  subset  I of 

I = {i|x^1)  > xl.1^  - h},  and 

let  II  denote  the  associated  subset  of  {IT.  ,II_ , . . . ,IL  } 
1 JL  Z.  K 

a)  If  11^  consists  of  one  population,  stop  campling 

and  assert  that  the  population  associated  vrith 

Xp,  is  best . 

[k] 

b)  If  11^  consists  of  more  than  one  population 
proceed  to  the  second  stage. 


2.  In  the  second  stage  take  n„  additional  independent 

c(2) 

\Lj  '“L  = J = 1‘2 

and  compute  the  cumuT.at  ive  sample  means 


observations  X^i  (1  < j < n,)  from  each  population 
in  II . _ 

no  <0\ 

Assert 


2 X^.V)/(n ,+n  ) for  i f I, 


X = ( l X t l "ij  " '“i ‘ “2 
3=1  J 3=1 

that  the  population  associated  with  max  X.  is  best. 

i€I  1 


(4.1a) 


(4.1b) 


Remark  4.1:  This  procedure  had  been  proposed  previously  by  Cohen  [1959] 
and  Alam  [1970].  Due  to  analytical  and  computational  difficulties,  most 
of  their  work  was  limited  to  the  special  case  k = 2. 


Remark  4.2:  If  h = 0 (h  = °»)  the  two-stage  procedure  P reduces  to 

Bcchhofer’s  [1954]  single-stage  procedure  P^  with  single-stage  sample  size 
n = n^  (n^+n^)  per  population.  Also,  the  rule  determining  I in  (4.1a) 
is  of  the  type  proposed  by  Gupta  [1950,1965]  in  his  subset  selection 
procedure . 


There  are  an  infinite  number  of  combinations  of  (n^n^h)  which  for 
any  k and  {<5*,P*}  will  exactly  guarantee  (2.1),  and  different  design 
criteria  lead  to  different  choices.  In  the  next  sections  we  consider  two 


of  these  criteria. 


8 


Let  S'  denote  the  cardinality  of  the  set  I in  (4.1a),  and  let 


S 


0 if  S'  = 1 


S'  if  S'  > 1. 


(4.2) 


Then  the  total  sample  size  required  by  P^n^n^h)  is 

T = kn2  + Sn2-  (4.3) 

Let  E^{T | P2>  denote  the  expected  total  sample  size  for  P^n^n^h)  under  t£. 
4 . 1 R-miniroax  design  criterion 

The  design  criterion  proposed  by  Alam  [1970]  is  the  following:  For 

given  k and  specified  {6*,P*}  choose  (n  ,n_,h)  to 

minimize  Sup  E {t|P„} 

HfJUfi*)  2 

subject  to  Inf  P {CS|P  } >.  P*,  (4.4) 

yffl(6*)  2 

where  n^»n2  are  non-negative  integers  and  h ^ 0. 

We  denote  by  (n, ,n„,h|R„)  the  exact  solution  to  (4.4),  and  by  P„(R^)  the 

X Z hi  Z ju 

procedure  using  this  solution.  The  R-minimax  criterion  in  which  minimization 

takes  place  over  a restricted  portion  of  ft  insures  that  E^{T|P2(R^)}  <_ kn 

V y f 8(6*)  for  any  given  k and  specified  {6*,P*>.  However,  it  ignores 

what  can  happen  to  E { T j P ( R ) > if,  unknown  to  the  experimenter, 

|i  Z hi 

y f 8g(6*).  Indeed,  for  y^-j  = y^ 


it  is  possible  to  have 


8 


Let  S'  denote  the  cardinality  of  the  set  I in  (4.1a),  and  let 


0 if  S'  = 1 


S'  if  S'  > 1. 


(4.2) 


Then  the  total  sample  size  required  by  P^n^ ,n2?h)  is 


T = knj  + Sn2> 


(4.3) 


Let  E {T|P2>  denote  the  expected  total  sample  size  for  P2(n^,n2,h)  under  u_. 
4.1  R-minimax  des.ign  criterion 

The  design  criterion  proposed  by  Alam  [1970]  is  the  following:  For 

given  k and  specified  {6*,P*}  choose  (n^n^h)  to 


minimize  Sup  E {T|P  } 

ufn(6*)  H- 


subject  to  Inf  P (CSjP  } ^ P*, 
|if«(6*)  - 


(4.4) 


where  ni>n2  are  non-negative  integers  and  h >.  0. 


We  denote  by  (n^,n2,h|RE)  the  exact  solution  to  (4.4),  and  by  P^R^)  *ke 
procedure  using  this  solution.  The  R-minimax  criterion  in  which  minimization 
takes  place  over  a restricted  portion  of  fi  insures  that  E^{t|P2(R^,)}  <.  kn 
V f fi( 6*)  for  any  given  k and  specified  {6*,P*}.  However,  it  ignores 
what  can  happen  to  E {T | P (R  )}  if,  unknown  to  the  experimenter, 

P Li 

y_  C fJQ(6A).  Indeed,  for  p^  = pj-^j  it  is  possible  to  have 


»>■*  • 
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E {T|P2(Re)>  » kn  for  P*  sufficiently  close  to  unity  (as  happens  when 
E^{T}  of  the  Wald-Girshick  SPRT  is  compared  to  the  total  single-stage 
sample  size  which  guarantees  the  same  probability  requirement.  See 
Bechhofer  [i960]  and  B-K-S  [1968].  Section  12.8.1),  It  is  to  guard  against 
this  latter  undesirable  possibility  that  we  propose  the  design  criterion 
described  below. 

4 . 2 U-minimax  design  criterion 

Our  design  criterion  is  the  following:  For  given  k and  specified 

{f*,P*}  choose  (n^n^.h)  to 

minimize  Sup  E {T|P  } 
urn  - 2 

subject  to  Inf  P {CS|P„}  >.  P*,  (4.5) 

prf2(S*)  -- 

where  nj_'n2  are  non~neSaf^ve  integers  and  h > 0, 

We  denote  by  (n,  ,n,,,h  |u„)  the  exact  solution  to  (4.5)  and  by  P^dJ^) 

X Z Li  Z u 

the  procedure  using  this  solution.  Our  U-minimax  criterion  (4.5)  in  which 
minimization  takes  place  over  the  unrestricted  parameter  space  fi  insures 
that  E (t[P  (U  )}  < kn  V g f fi  for  any  given  k and  specified 

]i  Z £j 

in  this  sense  P^U^)  un^form-l-y  4.)  superior  to  P^. 

As  the  first  step  in  determining  (n^^jhlRj,)  or  (n^,n0>hjUF)  we 
find  an  exact  analytical  expression  for  P^{CS|P2>. 


■ UMi 
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5.  Probability  of  a correct  selection  for  P^ 

5.1  General  expression  for  P^{CS | P^} 

Our  result  concerning  a general  expression  for  P^{CS|P2)  is  summarized 
in  the  following  theorem: 

Theorem  5.1 : For  any  y_  f Q we  have 


P {CS|P_}  = 
U 2 


' -ao 


V 
L 

sfS  ■'-« 


f x+f  . '/r\  /a  i 

H I 4>[y  + (x-z)(p/q)1/2  + -f  (|)1/2]d<Kz) 

ifs]  ■*x+(dk^-h)/n1/a 


x n $[x  + (6  .-h)v/n  /a]d$(y)d*(x) 

i*s 


k-1 


<00  *00 

x-6,  .nT,  /a 
r k]  1 

J — CO  J —CO 

•*x-(5,  ^+h)*^7  /o  ‘ 

S «*• 

(5.1) 


1 /2  1/2 
y+(x-u)(p/q)  -S^jCm/q)  /o 


(f  x-6../n  /a 

Jf  11 

ifs  ^*x-(5^j+h)^T^/o 


x ( n $[x  - (6. .+h)/n  /c])d$(v)d4(u) 
i£s  13 


4>[v  + (u-z)(p/q)^2  + ^ki^m/q)”*"^  /o]d$(z) 


d$(y)d|I>(x) , 


where  4>(  • ) is  the  standard  univariate  normal  cdf,  and 


S = the  collection  of  all  subsets  of  {1,2 , . . . ,k-l) , 


Sj  = the  collection  of  all  subsets  of  (1,2,. . . ,j-l,j+l,. . . ,k-l}. 


in  — 9 


p = n^m,  q = n2/m. 


Proof:  Let  X<‘1; 


= l X,(}}  /n  and  X , . , = )'  f x(J> 

(i)  f (iya  1 (i)  / \ (i)a 

a-i  3=1  a-1 

is  the  ath  observation  in  the  jth  stage  from  II,..,  all  xiD 
~ — (1)  ( 1 )a 

(I  < i 1 1 ia  4n_. » j = 1,2)  being  independent.  Then 


xi . 
,1 


/m  where  X^  ? 

(i)a 


P (CS|P^>  = l P {X^  = X^M,  X***  > x[}\  - hVi(s; 
h 1 2 s^s  b (k)  [k]’  (l)  = [kj 


xa!<sw-h  Vifs;  V>  * x«)  v 1 c s) 


k-1 

+ l l P { 
“ L 11 


yU)  - y(D  y(D  > y(D  h 
. . O^o  bl‘v(j)  ' Ck]’  x(k)  = x[k]  " n» 

j <-  -j 


(5.2) 


y(D  > "(I) 

X(i)  = “DO 


X)tc  > X'rTA  - hViCs;  X)t;  < X'tn  - h V i { s; 


^r(l)  ^ y(l) 
v(i)  X[k] 


X(k)  > X(i ) ’ X(k)  > X(i)  V 1 C s} 


k-1 


= l A + I l B 
•o  s -7-0  s,3 

S(X6  3=1  sfcS. 


Denoting  [(x|^  - b[.-j)/n^/a,  (X..^  - b j- ^ -j ) v'm/o ] by  EXj.»Yj]*  we  see 
that  [X^,Yd  has  a standard  bivariate  normal  distribution  with  correlation 
coefficient  = /p  (1  < i <_k).  In  what  follows  we  shall  use  the  equality 


4>2[a,b|p] 


f * 

J — CO 


b-pz 

(l-p2)1/2_ 


d$(z) 


(5.3) 


where  4>2 C * » • | P 3 is  the  standard  bivariate  normal  cdf  with  correlation 
coefficient  p (-1  < p < 1). 


We  first  consider 


Ag  = + ^i*^/0  > Xi  ? \ + (i^i-h)»^i/o  V i p s, 

Xk  + > ^i  v i t s. 


Y.  + 6,  .rWo  > Y.  V i f si 
k ki  i 


= |f  n {$2[u  + 5^^/n^/o,  V + 6kii/m/o  I V^p] 

J _oo  J -ao  i£s 


- 4*2^  + <*ki~h),^i'/0*  v + fiki^°  I 


x II  4>[u  + (6,  .-h)</n7/a]d4>  [u,v|i/p] 
its  X1  1 * 


=rr  .ft 

J _0O  J -a>  ifS  1 J 


u+6.  . /n  /a 
ki  1 

u+(<5,  .-\\)Jn7/a 
ki  1 


(5.4) 


‘J’Cv/Zq  - zCp/q)1^  + 6,  . (m/q)1'/2/a]d<t(z) 


x n 4>[u  + (6  .-h)vrTr/o]d<I>0[u,v|^p] 

• L KI  -L  4 

l^s 


>00  SCO  I 

•n  \ 
-°°  iCa  I 


f x+6  .^T/o 
If  ki  1 


ki  1' 

x+(6^-h)'/n^/o 


*[y  + (x-z)(p/q)1/2  + 6.  . (m/q)1/2/a]d$(z) 

KX 


II  $[x  + (6  .-h)i4i  /a]d4>(y )d4>(>:) . 
« t,  KX  X 


The  next  to  last  equality  was  obtained  using  (5.3),  and  the  last  was  obtained 

1 / ^ 

by  making  the  transformation  x = u,  y = (v-urp)/(l-p) 

We  next  consider 
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3 . 
s,3 


= P {X.  - 5,  . /n'/a  > X,  > X.  - (6,  .+h)/nT /a, 
y j k]  1 K = j k]  1 


X.  - 6../n1/o  > X.  > X.  - (6.  .+h)/nT/a  Vi  f s' 
3 il  1 1=3  13  1 


X.  - (6..+h)/n7/a  > X.  V i t s 
3 13  1 l 


Y,  + 6,  . /m/a  > Y.  , Y,  + 6,  . /m/o  > Y.  V i C s } 
k ki  3 k ki  l 


(5.5) 


fCO  |>00 

co  J _ao 


x-6,  ,/nT/a 

k)  1 


K3  1 f”  _ 

i f II  (*9[x  - 6. ./n  /a,  w + 6 /m/o|/p] 

x-(<5,  ,+h)/n7/a  Jt-6.  . /m/a\ifs  1 J 

ki  1 ki  ' 


$2Hx  - (6  +h)/n^/a,  w + 6^^/m/o | /p] } 


x n $[x  - (6.  .+h)/^/aj)dO-(u,w|/p) 
i/s  ^ 1 / l 


di2(x,t | /p) . 


Proceeding  as  with  A^,  we  apply  (5.3)  to  (5.5)  and  then  make  the  transforma- 
tions x = x,  y = (t-x/p)/(l-p)^2  and  u = u,  v = (w-u/p)/(l-p)  ^ . 
Substituting  the  resulting  expression  and  (5.4)  in  (5.2)  we  obtain  (5.1). 


Corollary  5.1:  Let  vi_( 6 ) denote  any  f Q such  that  ] = 4^] 

where  6 > 0.  ( u ( 5 ) is  known  as  a slippage  configuration.)  Then  we  have 


- 6 


?u«>{cs|v = L Li 


- x+5/n^/a 


x+(6-h)/n^/a 


4>[y  + (x-z)(p/q)1/2  + 6(m/q)1/2/a]d$(z) 


Y-l 

+ flCx  + (<$-h)/n^/a]  ) d4>(y)d4>(x) 


(5.6) 


Lf 


+ (k-l) 


x-6/n7/o 


CO  J -00 


r C“ 

x-(6+h)/n^/a  ■'y+(x-u)(p/q)1'/2-5(m/q)1/,2/o 


4>[v  + (u-z)(p/q)1/2  + <S(m/q)1/2/a]dO(z) 


V x-h/n^/o 

+ t(x  - h/n7/a)^  d4'(v)d4>(u) 


\k-2 


d4>(y  )d4>(x) . 


1'i 


Proof;  The  proof  is  straightforward. 

Remark  5.1:  For  k = 2,  (5.6)  simplifies  considerably;  the  resulting  expres- 

sion is  given  by  Alam  [1970]  as  his  equation  (3.1).  Alam  also  gives  an 
expression  for  P^^tCSjP,,}  (see  his  second  equation  (3.24));  however,  we  were 
not  able  to  verify  his  expression. 

5 . 2 LF-conflguration  for  P^ 

In  order  to  solve  (4.4)  or  (4.5)  it  first  is  necessary  to  determine  the 
LF-configuration  of  the  y^  for  P^,  i.e.,  to  determine  any  y^  f fi(£!C) 

such  that 

P (CS|P  > = Inf  P {CS|P  }.  (5.7) 

Ho  2 uffi(6*)  - 

As  a first  step  toward  determining  y^  we  now  study  the  monotonicity  of 

P (CS|P0}  w.r.t.  the  yr.n  (1  <.  i < k). 
y_  1 2 LiJ  — 

Lemma  5.1:  For  fixed  k and  (1  i 4 k~l)  and  fixed  (n^,n2,h), 

P^(CS | Pj)  is  non-decreasing  in 

Proof:  From  (5.4)  and  (5.5)  we  have 


py (cs | P2>  = Py{  U CtO^  + 6kir^/o  > xi  > \ + (6ki"h)’^T/a  v 1 € s’ 

Xk  + (5ki~h),An?a  > Xi  v 1 * s) 

U .U  (Xj  _ 6kj*/^l/a  > Xk  > Xj  ‘ (6kj+h)’'^l/0’  (5.8) 

3fs 

X.  - 6 . . > X.  > X.  - (6..th)^7/o  V i f s,  i t j; 

1 H 1 i 1 il  1 

X^  - (6ij+h)/n^/o  > X.  V i (f  s)}  n (Yk  + S^vWa  > Y£  Vif  s}]} 
= P{A(y)> 
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where  A(vO  is  in  the  sigma  algebra  generated  by  the  r.v.'s  [X^,Y^] 

Now  consider  a vector  u_'  = (y*  , ...,y')  where  Ur»-i  = Pr-l  (1  < i < k-1) 

i.  K Li  J Li  J 

and  > Then  P^,{CS|P^}  = P{A(_y')}.  We  shall  show  that 

A( ) pA(ji).  We  denote  the  value  taken  on  by  a r.v.  X at  a sample  point  to 

by  X(u).  Also  let  6^  = - y[j]  ^ i k).  Then  6^  ^ > 6^ 

(1  < i <k-l)  and  6 ! . = 6 . . (l<i,j<k-l). 

Fix  to  f A(y^)  which  corresponds  to  some  set  s C S. 

Case  1:  Suppose  that  to  belongs  to  the  following  event : 

{Xk(oj)  + 6^. v^/a  > X^(co)  > X^Cco)  + (5^^-h)i4i^/cr  V i f s, 

X^(co)  + (d^-hjvn^/a  > X^(u>)  V i ( s). 

Then  it  also  belongs  to  the  event 

{ X^Cu) ) + (S^Pn^/a  > X^(to)  > X^Cu)  + (6^-h)v'n7/o  Vic  s', 

X^(u>)  + (6jV-h)/n^/a  > X^(to)  V i t s'} 

for  some  set  s’  f S,  s'  c s . 

Case  2 : Suppose  that  s is  non-empty  and  for  some  j f s,  to  belongs  to  the 

following  event : 


{X.(to)  - 6,  .Si vT/ct  > X,(to)  > X.(to)  - (5  •+h)*/n7/a, 
1 k]  1 k ] k]  j. 


(5.9) 


X^(to)  - 6^/rvj7a  > X^(to)  > X^(u)  - (6^+i;  )/n^/a  V i f s,  i i j ; 
Xj(<o)  - (6„+h)PrTJ7o  > X^(to)  V i ( s). 


%v. 


1G 


Now  suppose  that  <5^.  in  (5.9)  are  replaced  by  5^..  and  that 

X_.(u>)  - 5^j^nJ/o  > X^(io)  is  still  satisfied.  Then  (5.9)  holds  with  6^ 

replaced  by  s'. . (l<i,j<k)  and  s = s' . On  the  other  hand,  if 

ID 

X.(u>)  - <5 ' . </n~/o  > X.  (a.)  is  violated,  then  u must  belong  to  the  following 
D *3  i K 

event : 


(X^(w)  + 6^  t-n^/o  > X^(w)  > Xj^(to)  + (6^i.-n)v^’/a  V i £ s', 

X,  (w)  + (s'  . -h)/n 7/ a > X.(to)  Vi^s'} 
k ki  1 l 


for  some  s'  c s . 

From  Cases  1 and  2 we  have  s'  C s,  and  hence  we  obtain 


{Y  (w)  + <5  . /n  /a  > Y.(u>)  V i P s) 
k ki  1 l 

-=>  {Y  (w)  + 6.  . /nT/o  > Y.(io)  V i f s'}, 
k ki  1 l 


Therefore  w C h(v)  -“=>  w C A(y/ ) and  A(u^)  c A(jj' ).  Hence  P{A(p/)}  >.  P{A(i£)} 
and  P^,{CS|P2}  ^P^iCS^}  which  completes  the  proof  of  the  lemma. 

Corollary  5.2:  P ^ ^{CS | P2>  is  non-decreasing  in  5^0  when  = ^[k-1] 

is  fixed.  In  particular,  for  k = 2 P^{CS|P2>  achieves  its  infimum  over 

fi( 5*),  i.e.,  satisfies  (5.7),  at  any  p satisfying  p^j  “ Pp]  = 6*- 

Proof : Since  is  translation  invariant,  P^{CS|P2)  depends  on  p^  only 

through  the  6^  (1  < i < k-1),  The  result  then  follows  from  Lemma  5.1. 

Remark  5.2:  The  method  of  proof  used  for  Lemma  5.1  does  not  carry  over  to 
prove  our  intuitive  conjecture  that  P^{Cs|P2)  is  non- increasing  in  each  Vq] 
(i  t k)  when  the  remaining  p-values  remain  fixed;  nor  have  we  been  successful 
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in  finding  alternative  methods  of  proof.  However,  Monte  Carlo  samplings  that 
we  have  conducted  have  supported  this  conjecture.  Nonetheless,  the  monotonicity 
of  P {CS|P„}  in  the  5,  . (1  < i < k-1)  for  k > 2 still  remains  an  open 
question.  We  believe  that  the  following  conjecture  made  by  Alam  [1970]  is  correct 

Conjecture  5.1:  For  fixed  k > 2 and  (n^n^h),  the  slippage  configuration 

u(<5*)  is  a LF-configuration  for  P . 

5.3  Lower  bound  for  P {CS|P„} 

y i 2 

In  this  section  we  derive  a lower  bound  for  P {Cs|P_}.  This  lower  bound 

U 1 2 

will  prove  to  be  particularly  useful  for  k > 2 since  we  will  prove  that  it 
achieves  its  infinum  over  0(6*)  at  y(6->),  the  conjectured  LF-conf iguration 
for  P^i  this  result  will  permit  us  to  construct  a conservat ive  2-stage 
procedure  (for  k > 2)  which  will  guarantee  (2.1).  The  lower  bound  involves 
integrals  the  values  of  which  can  be  easily  calculated  on  a digital  computer. 

Theorem  5.2:  For  any  p f C we  have 

fk-1  k-1 

P {cs|P2>  > ( H *[x  + (6ki+h)^/o]  + n 

— ^-»vi=l  i=l 


$[x  + 


6.  . /n/o 
ki 


d$(x)  - 1.  (5.10) 


J 


Proof: 


1 - P {CS|P„}  = P {Incorrect  selection  I P0} 
y 1 2 y_  '2 

< P (X/,1?  < X^.  - h for  some  i i k)  + P {X,.  . 
= y (k)  (x)  y (k) 


= 1 - P {xii!  > X^J  - h V i / k}  + 1 - P (X.,  . 
y (k)  = (1)  y (k) 


ra>  k-1  f00  X-J. 

n $[x  + (6,  .+h)*/nT/a]d$(x)  - II 

J-co  i=i  ki  l i=1 


r 00  k-1 


< X...  for  some  i i k) 
(i) 

i *(i)  v i i k) 

$[x  + 6k^*^m/o]d$(x) . 
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A rearrangement  of  the  terms  gi.ves  the  desired  lower  bound. 


Corollary  5.3:  For  all  u P f2(6*)  we  have 


>co 

p {CS|P2>  > {4>k_1Cx  + (6*+h)/n^/o]  + 4>k_1[x  + 6*/m/a]}d4>(x)  - 1.  (5.11) 

— _oo 


Proo£:  The  proof  follows  immediately  on  noting  that  the  r.h.s.  of  (5.10)  is 

non-dscreasing  in  each  6,  . for  1 <_  i < k-1. 

Corollary  5.4:  Since  the  r.h.s.  of  (5.11)  is  strictly  increasing  in  each  of 

n7  , m,  h,  and  -*-1  as  n^  or  as  n„  and  h -*■  ",  we  see  that  (2.1)  can 
be  guaranteed  if  all  are  chosen  sufficiently  large . 

As  a consequence  of  Corollary  5.4  it  is  clear  that  a conservative  two-stage 
procedure  which  guarantees  (2.1)  and  which  employs  either  the  R-minimax  or  the 
U-minimax  design  criterion  can  be  constructed  and  implemented  using  the  lower 
bound  given  by  the  r.h.s.  of  (5.11).  We  shall  denote  such  procedures  which 
employ  these  criteria  by  P2(RC)  and  ^UC^’  resPec'tively • P2(R^) 
conservative  relative  to  P^R^, ) (as  is  P^tU^,)  relative  to  ?2(U^))  since 
it  ovcrprotects  the  experimenter  with  respect  to  (2.1),  this  overprotection  being 
purchased  at  the  expense  of  an  increase  in  E^{t|P2(Rc)}  and  E^{T|P2(Uc)}  at 
y(6*)  and  p_(0),  respectively.  We  consider  P2(R^)  and  P2(Ur,)  in  detail 
in  Sections  7-9. 


Remark  5.3:  If  we  let  h -*■  « on  the  r.h.s.  of  (5.10)  we  obtain 

roo  k-1 

IT  ^Cx  + 6,  . *Wa]d<5(x)  which  is  an  expression  for  P {CSlP,  } where  P 
i-i  kl  H ' 1 1 

uses  a common  single-stage  sample  size  m per  population.  Thus  P^  is  a 

special  case  of  any  P^(‘)  based  on  the  conservative  lower  bound. 


6.  Expected  total  sample  size  for  P 

In  order  to  solve  either  (4.4)  cr  (4.5)  we  first  find  an  analytical 

expression  for  E {TjPg};  this  is  done  in  Section  6.1.  Then  it  is  necessary 

to  determine  Sup  E {T|P„}  for  y f fi(6*)  and  for  y f Q for  (4.4)  and  (4.5), 
y z.  — 

respectively;  the  sets  of  y-values  at  which  these  suprer.ia  occur  are  found  in 
Section  6.2. 

6.1  General  expression  for  E^{t|P„} 

Our  result  concerning  a general  expression  for  E^{T|?2)  is  summarized 
in  the  following  theorem: 

Theorem  6.1:  For  any  y_  ( ft  we  have 


E {T| P _}  = 

y 1 2 


(6.1) 


k r k k ^ 

kn  + n„  \ / n $[x  + (6 . .+h)v''nT/c]  - H $[x  + (6 . .-h)t/nT/o]>d$(x) . 

1 ^ i=i  = i 13  1 j=l  1J  1 J 

j^i  j^i 


Proof : 


E,{T|P2}  = knx  + n2Ep{s|P2) 


kni  + n2[Eii{S'|P2)  - P^( S ' = 1|P2>] 


(6.2) 


kni  + n2 


.1.  i h v it  ° 

_l-x  — 

- X y*u!  i s1s! ♦ h a 3 * » 


Theorem  6.1  follows  immediately. 


2C 


6.2  The  supremum  of  E^{T|P?} 

Our  results  concerning  the  supremum  of  E^{TjP0}  for  p_  f n(6ft)  and 
P f £1  are  summarized  in  a)  and  b)  of  the  following  theorem. 

Theorem  6.2:  For  fixed  k and  we  have  that 


a) 


Sup  E {T|P  } = kn 

y - 


+ n. 


/Oo 

{$>  *[x  t (6*+h)/n^/a3  - 4>k  ^[x  + (6*-h)/^/o]}d<I>(x) 

j —00 


•CO  0 

(k-1)  { $A  ‘‘Lx  + h/rT|7a3'>C 

-oo 


(6.3) 


x - (6*-h)y'n^/aj 


- ^Cx  - ht/n^7o]4'[x  - (6!’!+h)/n1/o ]}d4>(x) 


which  occurs  when  = Uq.  - p^-,  - 6* 


,00 

b)  Sup  E {T|P2}  = kn.^  + kn2  i {4>k  ~[x  + h/n7/a]  - 4>^  ^[x  - h/n^/aDid^x)  (6.4) 
p££)  — ' -oo 


which  occurs  when  = u[p]’ 

Me  shall  prove  part  b)  of  Theorem  6.2;  the  proof  of  part  a)  follows  along 
the  same  lines. 


Proof:  Gupta  [19G5]  has  shown  that  E {S'[P^}  achieves  its  supremum  for 

\t_  £ (1  when  = p^y  It  only  remains  to  show  that  P^{S'  = 1 } achieves 

its  infimum  when  = Pj-^]’  ^e  use  the  method  of  Gupta  Cl965]. 

Set  4q-j  = ...  = ^[b]  = 4 < 4[b+i'j  ^or  sorne  b (1  4 b 4 k-1)  and 

define  6.  = pr.,  - p for  b+1  < i < k.  Define 
i Li]  “ “ 


Q(li)  = Py{3'  = l|P2;  PCl]  = ...  = wLb]  = u> 


[CO  K 

= b 1Cx  - hi/nT/cr]  II  $[x  - (6  .+h)vn~/o]d<P(x) 

J-CC  1 j^b  + 1 3 

k f®  k 

+ £ I 4>  [x  + (6.-h)/n  /o]  II  *[x  + (6  . . -h ) /n7/o ]d$ ( x ) . 

i-b+1  ■’-»  11  j=b+l  13 


After  differentiating  w.r.t.  p,  and  then  interchanging  the  order  of  inte- 
gration and  sunmation  in  the  first  term,  and  making  appropriate  substitutions , 
we  obtain 

<t  Cx  + (6  .-h)/n7/al  II  <J>[x  + (6  . .-h)/nT"/o] 

1 L j=b+l  13 

j^i  (6.5) 

•{$Cx  - hi4rj7a]<j>[x  + - <J>[x  + (5^-h)/n^/a](f (x)}dx  <_  0. 


dQ  = 

dp 


b/n„ 


k 

I 

i-b+1 


The  last  inequality  is  obtained  by  noting  that  the  quantity  inside  { } in 

(6.5)  is  non-positive  for  every  x and  i for  b+1  < i < k due  to  the 

monotone  likelihood  ratio  property  of  <}> . It  follows  that  Q is  non- increasing 

in  p and  is  in  fact  strictly  decreasing  if  h,n^  > 0.  Thus  subject  to 

pr  = ...  = p0,,  we  see  that  P {S'  = l|P0}  is  minimized  by  increasing 

LI  J LuJ  p_  * 

the  common  value  p until  p = h[b+l]'  Since  this  is  true  for  each  b k-1, 
it  follows  that  P {S'  = l|P„)  is  minimized  and  hence  E {TIP-}  is  maximized 

p 1 2 p_  1 2 

over  n when  Uqjj  = P^]* 

Using  the  results  of  Theorem  5.1  and  Lemma  5.1  along  with  Theorem  6.2  we  can 


now  formulate  our  optimization  problems  (4.4)  and  (4.5)  precisely. 


In  this  section  we  consider  the  optimization  problems  (4.4)  and  (4.5) 

which  one  must  solve  in  order  to  determine  (n^ ,n0 ,h| R^,)  and  (n^,n?,h[U^,) 

which  are  necessary  to  implement  P (Rr)  and  As  note<^  Section  5.2, 

we  have  not  been  successful  in  determining  the  LF-conf iguratior.  of  the  Uq-j 

(1  < i < k),  except  for  k = 2.  Thus  for  k > 2 we  replace  the  exact 

Drobability  Inf  P {CSjP^}  by  the  conservative  lower  bound  on  that 
Uffi(6*)  - 

probability  given  by  the  r.h.s.  of  (5.11),  and  consider  the  following  optimiza- 
tion problems : 

7 . 1  Discrete  optimization  problems 

7.1.1  R-minimax  design  criterion 
For  given  k and  specified  choose  (n^,n^,h)  to 


minimize  Sup  E (T)P,} 

uea(«*)  - 

(7.1) 

fCO 

subject  to  {<t^'  1[x  + (<5*+h)/n^/aj  + 0^  1[x  + 6*/m"/a]}d<?(x)  - 1 >_  P*, 


where  n^»n2  are  n°n-negative  integers  and  h >.  0. 


In  (7.1),  Sup  E {T|P0}  is  given  by  (6.3).  We  denote  by  (n  ,n  ,h|R  ) 

ufn(6*)  2 

the  solution  to  (7.1),  and  regard  it  as  a conservat ive  solution  to  (4.4);  we 


denote  the  corresponding  procedure  by  P2(R,-,). 


7.1.2  U-minimax  de sign  criterion 
For  given  k and  specified  {6*,p*>  choose  (n^^jh)  to 


r 

■y 

t 


4.  ^ 


minimize  Sup  E {T  j P } 
— 


(7.2) 


subject  to 


{4^  ^[x  + ( 6*+h ) v/n^/o ] + 4^  1[x  + 6*/m  /o])d4>(x)  - 1 > P*, 


where  ,n^  are  non-negative  integers  and  h >=  0 . 


In  (7.2),  Sup  E { T | P } is  given  by  (6.4).  We  denote  by  (n  ,n?,h|U  ) the 

solution  to  (7.2),  and  regard  it  as  a conservative  solution  to  (4.5);  we  denote 
the  corresponding  procedure  by  P^(V^/. 

7 . 2 Continuous  optimization  problems 

The  problems  (7.1)  and  (7.2)  are  extremely  complicated  integer  programming 
problems  with  non-linear  constraints  and  objective  funct5.ons.  Although  these 
problems  can  be  solved  in  principle  by  enumeration,  the  search  is  likely  to 
be  a costly  one  because  of  the  numerical  evaluation  of  the  integrals  involved. 
Additionally,  since  the  solution  depends  on  6*,  a separate  solution  is 
required  not  only  for  each  k and  P*-value,  but  also  for  each  6*.  Hence 
we  shall  remove  the  restriction  that  n^  ,n2  m^st  be  integers ; we  reparametrize 
the  problem  and  regard  the  new  design  constants  (which  are  functions  of  n^, 
n^,  and  h)  as  continuous . We  use  this  continuous  version  as  a large  sample 
approximation  to  the  discrete  version. 

We  define  the  new  design  constants 

= 6*/n^/a,  c2  = 6*/n^/a , d = hSn^/a.  (7.3) 

We  note  that  the  exact  expression  for  P^{CS|P2)  and  the  conservative  lower 
bound  on  it,  a3  well  as  (T ( P2 } depend  on  (n^n^h),  6*,  o only  through 
(c1,c2,d)  and  6,^/6*  (1  < i < k-1). 


r w 


Thus,  for  example,  for  given  k and  specified  {6*,P-'!}  we  can  approxi- 
mate the  design  constants  (n^,n2,h|uc)  necessary  to  implement 
solving  the  continuous  optimization  problem: 


(7.4) 


00 

minimize  kc^  + kc^  {ft**  '*'(x+d)  - ^(x-d) }d<J>(x) 

—00  s **  . 

(7.4 

subject  to  f {<t^  ^(x+c  +d)  + 4>K  \x  + (c^+c^)1^ )}d<f(x)  - 1 P* 

J _QD  ^ 


where  c^,  c^,  d >_  0. 


We  denote  by  (c^,c0,djuc)  the  solution  to  (7.4),  and  use  the  approximate 


design  constants 


"l  = (sH  + 1 * ^2 


Xf  + lL  h = ^ (7.5) 


where  [z]  denotes  the  greatest  integer  < z,  to  implement 

Similarly,  for  k = 2 and  specified  {6*,P*}  we  can  approximate  the 

design  constants  (n  ,n  ,h|U  ) necessary  to  implement  P^(U  ) by  solving 

X 2 L Z l> 

the  continuous  optimization  problem: 


minimize  2c^  + 2c2{0(d//2 ) - $(-d//2)} 


(7.6) 


(c1+d)//2  

subject  to  4>C(c  -d)/»^]  + [ 4>  [-x/p/q  + *//(c^+c^)/2q]d$(x)  .>  P* 

J(c,-d)//2  1 - 


where  c^,  c2,  d >,  0. 


A A A . 

We  denote  the  solution  by  (c^c^djUj.).  Analogous  expressions  can  be  written 


%>**  tmi 


1 


in  order  to  approximate  the  design  constants  (n^,n2,h|Rc)  for  k >.  2,  and 

(ni’n2’^I^E>  f°r  k = 2> 


3 . Constants  to  implement  P^ 

8.1  Constants  to  implement  an^  P0(RE)  for  k = 2 

Table  1 contains  constants  necessary  to  approximate  (n^  ,n,2  ,n| U^,)  and 

(ni ,n2 ,hl RE>  ^cr  k = 2 and  selected  P*;  although  we  are  primarily 

interested  in  the  ones  associated  with  P (U  ),  we  have  computed  those 

Z L 

associated  with  P2(RE>  for  comparative  purposes.  (See  Section  9.)  The 
computations  for  P^U^)  are  t*ie  solutions  of  (7.5),  while  those  for  P^R^) 
are  the  solutions  of  the  analogous  nroblcm  wherein  Sup  E {Tip  } over  fi(6*) 

y.  2 

is  minimized.  The  constants  given  here  for  P0(U_,)  and  P2(Rj,)  are  exact 
since  the  LF-configuration  for  P {CS}  is  known  for  k = 2. 

n 


8.2  Constants  to  implement  P„(U„)  for  k >_  3 

Table  2 contains  constants  necessary  to  approximate  (n^n^h]!^)  for 

k = 3,4,5,10,15,25  and  selected  P*;  the  computations  for  P^U^)  are  the 

solution  of  (7.4).  The  constants  given  here  for  P2(UC)  are  conservative 

since  the  LF-cor figuration  for  P {CSlP.}  is  unknown  for  k > 3.  (We  have 

y_  1 2 = 

not  attempted  to  compute  the  constants  (c  ,c  ,d|U  ) which  would  be  used 

1 Z L 

if  the  conjectured  LF-configuration  for  P^{CS|P0}  were  indeed  4^^  - Wqt  = 
(1  <.  i <_  k-1)  for  k >_  3;  such  computations,  although  of  interest,  would 
be  difficult  to  carry  out  because  it  v/ould  be  necessary  to  evaluate  numeri- 
cally very  complicated  iterated  integrals . ) 

All  of  the  computations  for  Tables  1 and  2 (as  well  as  those  described  in 
Section  9)  were  carried  out  in  double  precision  arithmetic  on  either  Cornell's 


IBM  360/65  and  IEM  370/168  or  on  Northwestern's  CDC  6400.  To  solve  the 


T le  1 

Constants  to  implement  an<^  ^or  ^ r ^ 


(£i,a2,d|uE) 


(Cj.S2.diRg) 


0.9999 

4.5397 

2.9087 

0.97215 

3.4801 

1 

4.4120 

1.9162 

0.9995 

3.9742 

2.6708 

0.95623 

3.1239 

3.9034 

1.6992 

0.999 

3.7062 

2.5712 

0.94824 

2.9566 

3 . 6506 

1.6026 

0.99 

2.7189 

2.0906 

0.91913 

2.2931 

i 

2.7371 

1.2803 

0.95 

1.8621 

1.6155 

0.83072 

1 

| 1.6583 

1.9347 

1.0574 

0.90 

1.4270 

1.3132 

0.85278 

1.3224 

1.4996 

0.92974 

0.85 

1.1391 

1.0930 

0.84174 

1.0789 

1.1830 

0.90951 

0.80 

0.91577 

0.90970 

0.82702 

| 

0.88255 

| 

0.96174 

0.87132 

0.75 

0.72801 

0.74161 

0.31999 

0.71036 

0.77072 

0.84413 

0.70 

0.56240 

0.58661 

0.80783 

0.55468 

0.59769 

0.82505 

0.65 

0.41227 

0.43299 

0.00202  j 

0.40845 

0.43957 

0.81002 

0.60 

0.26982 

0.20775 

0.79714  i 

0.26907 

0.2C914 

0.79865 

0.55 

0.13374 

0.14322 

0.79140 

0.13378 

1 

0.14313 

0.79053 

Table  2 


Constants  to  implement  f°r  ^ ^ 


k 

P* 

(c 

J 

1 . 
C1 

I r~ 
C2 

d 

0.99 

2.9326 

2.4083 

1.2458 

0.95 

2.0893 

1.8974 

1.6303 

0.90 

1.6699 

1.5491 

2.1814 

0.75 

1.0492 

I 0.97980 

3.9335 

0.99 

3.0432 

2.5805 

1.2596 

4 

0.95 

2.2400 

1 2.1105 

1.4806 

0.90 

1.8262 

1.7859 

1.8245 

0.75 

1.2203 

1.1712 

3.2365 

0.99 

3.1035 

2.7301 

1.2712 

0.95 

2.3184 

2.2622 

1.4604 

5 

0.90 

1.9209 

1.9786 

1.6403 

0.75 

1.3191  | 

1.3304 

2.7290 

0.60 

0.96047 

j 0.91856 

4.3038 

0.99 

3.2364 

3.1620 

1.3453 

0.95 

2.5094 

2.7750 

1.3529 

10 

0.90 

2.1466 

2.5349 

1.3830 

0.75 

1.5712 

1.9725 

1.6980 

0.60 

1.2248 

1.3491  | 

2.7648 

0.45 

0.95459 

0.92770  1 

. 

4.3433 

0.99 

3.2983 

3.3883 

1.3999 

0.95 

2.5886 

3.0259 

1.3771 

15 

0.90 

2.2344 

2.8212 

1.3676 

0.75 

1.6899 

2.4268 

1.3974 

0.60 

1.3404 

1.7600 

2.0210 

| 

0.45 

1.089  7 

1.1270 

3.5471 

0.99 

3.3634 

3.6572 

1.4783 

0.95 

2.6646 

3.3204 

1.4401 

25 

0.90 

2.3270 

3.1393 

1.3972 

0.75 

1.8000 

2.8539 

1.3234 

0.60 

1.4909 

2.7798 

1.1769 



0.45 



1.3026 

3.1516 

0.76886 

continuous  optimization  problems,  first  a "reasonably  good"  discrete  optimal 
solution  was  found  by  a search  method.  This  solution  was  used  as  an  initial 
guess  in  the  computer  program  using  a modified  version  of  the  steepest  descent 
method  to  solve  the  continuous  non-linear  programming  problem;  Algorithm  304 
of  Hill  and  Joyce  [1967]  was  used  to  evaluate  <t>(  * ) ; the  integrals  were 
evaluated  using  the  Romberg  method  of  integration.  We  do  not  claim  that 
our  solutions  represent  the  absolute  optima,  but  they  are  reasonably  close  to 
the  optima.  (The  E^{T J P2 ( ) } -surface  is  very  flat  in  the  neighborhood  of 
the  maximum  for  P*  1/k  since  P2(UC)  ^1’^  The  ta^uiate^  values  should 
be  correct  to  the  number  of  significant  figures  given. 


9 . The  performance  of  P 2 relative  to  P 

As  a measure  of  the  efficiency  of  P (Bechhofer  [1954])  relative  to 

that  of  P2  when  both  guarantee  the  same  probability  requirement  (2.1),  we 

consider  the  ratio  (termed  relative  efficiency  (RE))  E^{T|P2>/kn  where 

2 * 

n = [(oa/6,:)  + 1],  and  c is  the  solution  of 


^(x+c)d*(x) 


Pft. 


(9.1) 


Clearly  RE  depends  on  p_  and  {5*,P*};  values  of  RE  less  than  unity  favor 
P2  over  P^.  For  mathematical  convenience  we  shall  use  the  continuous 
approximations  to  E^{T|P2)  and  n (thereby  ignoring  the  fact  that  the  sample 
sizes  must  be  integers).  RE  is  then  given  by 


k 


l 

i=l 


$(x+d+6. .c,/6*) 
13  i 


k 

n 


j=i 


♦(x-d+5ijC1/6*)> d$(x) 


/kc 


(9.2) 


t vfo 


2Cf 


where  we  employ  in  (9.2)  either  (c^c^dlP^Uj.))  for  k = 2 or 
^C1’C2’^^2^UC^  ^or  or<^er  to  compare  the  performance  of  P^U^,) 

with  that  of  P2(Uj,)  ^or  ^ = 2,  WQ  also  will  employ  (c^ .c^ ,d | Pjd),.,) ) for 
k = 2.  See  Table  3.)  The  value  of  c in  (9.1)  has  been  tabulated  for 
selected  k and  P*  by  Bechhofer  [1954],  Gupta  [1963],  and  Milton  [1963] 
(Bechhofer's  A = c,  Gupta's  and  Milton's  H = c//2. ) 

Pomark  9.1:  For  the  equal  means  (EM)  configuration  = p j- ^ -j , and  for 

the  u(6*)  configuration  - 6*  (known  to  be  LF  for 

P„(U_)  for  k = 2 and  for  P (U ) for  k >.  2,  and  conjectured  to  be 
z u z c 

LF  for  ^2^UE^  ^°r  ^ > 3),  we  note  that  RE  depends  only  on  k and  P* 

for  given  P2(0  an^  4.  ^ 

Table  3 which  concerns  P2(Ur)  and  P2  ( U^, ) for  k = 2,  and  Table  4 

w'hich  concerns  P0(U  ) for  k >_  3,  give  computed  RE-values  for  selected  P* 

and  p_  F n to  indicate  the  magnitude  of  the  saving  in  E{TjP}  achieved 

u 

by  the  screening  property  of  P„(U^)  (P„(U„))  when  P„(*)  is  used  in 

4 L / L Z 

place  of  P^  for  k > 2 (k  = 2);  the  computations  for  Tables  3 and  4 are 

A A A 

based  on  (c, ,c  ,d)  listed  in  Tables  1 and  2,  respectively. 

X X 

9.1  P2(U£)  and  P2(VQ)  vs.  P for  k = 2 

For  all  P*  we  note  that  RE  ...  is  less  for  P0(U  ) than  for  P„(U  ), 

p_(  0 ) 2 E l C 

but  RE  , . is  greater  for  P„(U_)  than  for  P_(lh,)  (since  n,  for 

p\°°;  i L ^ L X 

P2(tJj.)  turns  out  to  be  greater  than  n^  for  P2(U^)).  The  range  of 

(p[2]  - y[i;])/6*  = 6/6*  values  over  which  RE  ^ is  iess  for  P^U^,) 

than  for  P2(UC)  ^or  ^iven  P*  appears  to  depend  critically  on  P*  being 

small  for  P*  close  to  unity  and  large  for  P*  -*•  1/2  (since  in  this  latter 

situation  P2(UC)  P^)*  However,  of  greatest  importance,  is  the  fact  that 

for  either  P^U^,)  or  ^2^C^  usec^  at  any  P*  (l/2  < P*  < 1)  we  have 


Table 
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0.8300 
0.3366 
0.8518 
0.9171 
0. 9306 
0.9994 


0.7920 


0.8504 

0.9389 

0.9934 


0.7472 


0.7683 

0.8326 


0.6409  0.5916 
0.6778  0.5698 
0.7174  0.5695 
0.8323  0.6032 
0.9496  0.6968 
0.9970  0.8879 


0.6178  0.5662 
0.6463  0.5376 
0.6769  0.5296 
0.7653  0.5423 
0.8864  0.5820 
0.9831  0.6905 


0.5913  0.5381 
0.6109  0.5023 
0.6333  0.4914 
0.6954  0.4878 


0.7769 
0.9088 


0.5204 

0.6306 


0.5911 

0.5661 

0.5610 

0.5740 

0.6278 

0.7497 


0.5659 

0.5353 

0.5244 

0.5263 

0.5477 

0.6195 


0.5379 

0.5009 

0.4885 

0.4790 

0.5042 

0.6058 


0.5659 

0.5352 

0.5236 

0.5211 

0.5309 

0.5826 


0.5379 

0.5008 

0.4881 

0.4766 

0.4980 

0.5951 


0.5659 

0.5352 

0.5236 

0.5209 

0.5278 

0.5835 

0.5598 

0.5169 

0.4959 

0.4639 

0.4558 

0.4940 

0.5379 

0.5333 

0.5008 

0.4892 

0.4881 

0.4799 

0.4766 

0.4449 

0.4976 

0.4497 

0.5940 

0.5249 
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KE  , < 1 for  all  y € SI  and  RE  , . <<  1.  Thus  both  P„(U„)  and 

y_(5)  — y_(")  2 E 

V 2 ( ) are  highl^ effective  as  screening  procedures. 

9.2  P2(UC)  v£.  P^  for  k >.  3 

For  given  k >.  3 the  performance  of  P^U^)  relative  to  that  of  P^ 
as  measured  by  RE  for  1/k  < P*  < 1 and  u e Si  is  similar  to  that  noted 
for  k = 2.  In  addition,  if  we  regard  RE  as  a function  of  k for  fixed  P* 
and  configuration  of  the  y . (1  < i < k),  specifically  for  the  configurations 
y_(°h  vij-k-j  - Up^-j  = 6*,  - y = 6 (2  < i < k-1,  0 < 6 < »); 


y[k]  ' U[k-1] 


= ",  our  computations  indicate  that  RE  is  decreasing  in  k 
(although  this  has  not  been  established  analytically).  Thus  the  effectiveness 
of  Pj ( Uc ) as  a screening  procedure  appears  to  be  increasing  with  increasing 


k. 


10 . Directions  of  future  research 

The  most  important  unsolved  problem  associated  with  ?2  is  that  of 
determining  the  LF- configuration  of  the  y^  (see  (5.7))  for  k > 2;  as 
noted  earlier,  we  conjecture  the  answer  to  be  the  slippage  configuration 
y(5*).  If  this  conjecture  can  be  shown  to  be  true,  it  will  be  necessary  to 
find  efficient  algorithms  for  evaluating  (^.,c)fCSj  ^as  S^ven  by  (5.6)) 
numerically  before  the  design  constants  ^^»c2,djUj,)  for  use  with  P2(lE.) 
can  be  determined. 

A two-stage  minimax  screening  procedure,  analogous  to  P2,  can 
devised  for  selecting  the  smallest  (say)  normal  variance;  this  was  done 
in  Tamhane  [1975b].  However,  design  constants  to  implement  the  procedure 
must  still  be  computed. 

V 


%/r 


A 


Procedure  ? given  by  (4.1)  permanently^  eliminates  populations  for 
which  - h.  However,  can  be  modified  in  such  a way  that 

populations  from  which  a total  of  only  n1  observations  are  taken,  are 
eligible  for  selection  as  "best"  along  with  those  from  which  a total  of 
nl  + n2  observations  are  taken;  in  this  modification  we  assert  that  the 
population  associated  with  max{max  X.,  max  X.^}  is  best.  Such  a procedure 

i£l  1 #1  1 

was  considered  in  Tamhane  [1975a j;  an  exact  analytical  expression  for  the 
PCS  was  derived,  and  the  PCS  performance  was  studied  by  Monte  Carlo  sampling 
methods.  Aside  from  the  analytical  and  computational  difficulties  (as  well 
as  the  problem  of  determining  the  LF- configuration  of  the  u^),  procedures 
of  this  type  would  appear  to  represent  a fruitful  direction  of  generalization. 
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requirement  using  the  indifference-zone  approach  as  does  the  single-stage  pro- 
cedure of  Bechhofer  -£±954].  The  two-stage  procedure  has  the  highly  desirable 
property  that  the  expected  total  number  of  observations  required  by  the  proce- 
dure is  always  less  than  the  total  number  of  observations  required  by  the 
corresponding  single-stage  procedure,  regardless  of  the  configuration  of  the 
population  means.  The  saving  in  expected  total  number  of  observations  can  be 
substantial,  particularly  when  the  configuration  of  the  population  means  is 
favorable  to  the  experimenter.  The  saving  is  accomplished  by  screening  out 
*non-.contending'^  populations  in  the  first  stage,  and  concentrating  sampling 
only  on  '’contending'*  populations  in  the  second  stage.  The  two-stage  procedure 
can  be  regarded  as  a composite  one  which  uses  a screening  subset-type  approach 
(Gupta  [1956],  [1965])  in  the  first  stage,  and  an  indifference-zone  approach 
-(BecWtofer  [1954])  applied  to  all  populations  retained  in  the  selected  subset 
in  the  second  stage.  Constants  to  implement  the  procedure  for  various  k and 
P:V  are  provided,  as  are  calculations  giving  the  saving  in  expected  total 
sample  size  if  the  two-stage  procedure  is  used  in  place  of  the  corresponding 
single-stage  procedure. 


